KAAS: an automatic genome annotation and pathway reconstruction server

نویسندگان

  • Yuki Moriya
  • Masumi Itoh
  • Shujiro Okuda
  • Akiyasu C. Yoshizawa
  • Minoru Kanehisa
چکیده

The number of complete and draft genomes is rapidly growing in recent years, and it has become increasingly important to automate the identification of functional properties and biological roles of genes in these genomes. In the KEGG database, genes in complete genomes are annotated with the KEGG orthology (KO) identifiers, or the K numbers, based on the best hit information using Smith-Waterman scores as well as by the manual curation. Each K number represents an ortholog group of genes, and it is directly linked to an object in the KEGG pathway map or the BRITE functional hierarchy. Here, we have developed a web-based server called KAAS (KEGG Automatic Annotation Server: http://www.genome.jp/kegg/kaas/) i.e. an implementation of a rapid method to automatically assign K numbers to genes in the genome, enabling reconstruction of KEGG pathways and BRITE hierarchies. The method is based on sequence similarities, bi-directional best hit information and some heuristics, and has achieved a high degree of accuracy when compared with the manually curated KEGG GENES database.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Biennial Report on Carcinogens Listing/Delisting Procedure

The number of complete and draft genomes is rapidly growing in recent years, and it has become increasingly important to automate the identification of functional properties and biological roles of genes in these genomes. In the KEGG database, genes in complete genomes are annotated with the KEGG orthology (KO) identifiers, or the K numbers, based on the best hit information using Smith– Waterm...

متن کامل

From Function Prediction to Pathway Prediction: A New Pipeline Based on KAAS and GENIES

The number of complete and draft genomes has increased in recent years. The prediction of precise biological roles of the genes of such sequenced organisms is becoming an important issue in computational biology. We have recently developed two novel systems: KAAS (KEGG Automatic Annotation Server) [2, 4] and GENIES (Gene Network Inference Engine based on Supervised Analysis) [3] as computationa...

متن کامل

KAAS: KEGG Automatic Annotation Server

The number of complete and draft genomes has rapidly increased in recent years, and it has become increasingly important to identify the functional properties and biological roles of genes in these genomes. We have been developing KEGG Orthology (KO) to classify gene functions. In KO, we annotate genes in complete genomes based on best-hit information using Smith-Waterman scores, as well as by ...

متن کامل

A Parsimony Approach to Biological Pathway Reconstruction/Inference for Genomes and Metagenomes

A common biological pathway reconstruction approach -- as implemented by many automatic biological pathway services (such as the KAAS and RAST servers) and the functional annotation of metagenomic sequences -- starts with the identification of protein functions or families (e.g., KO families for the KEGG database and the FIG families for the SEED database) in the query sequences, followed by a ...

متن کامل

CycADS: an annotation database system to ease the development and update of BioCyc databases

In recent years, genomes from an increasing number of organisms have been sequenced, but their annotation remains a time-consuming process. The BioCyc databases offer a framework for the integrated analysis of metabolic networks. The Pathway tool software suite allows the automated construction of a database starting from an annotated genome, but it requires prior integration of all annotations...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 35  شماره 

صفحات  -

تاریخ انتشار 2007